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Abstract —With the explosive growth of mobile demand, small 
cells in millimeter wave (mmWave) hands underlying the macro¬ 
cell networks have attracted intense interest from hoth academia 
and industry. MmWave communications in the 60 GHz hand 
are able to utilize the huge unlicensed bandwidth to provide 
multiple Gbps transmission rates. In this case, devlce-to-devlce 
(D2D) communications in mm Wave bands should be fully ex¬ 
ploited due to no Interference with the macrocell networks and 
higher achievable transmission rates. In addition, due to less 
interference by directional transmission, multiple links Including 
D2D links can be scheduled for concurrent transmissions (spatial 
reuse). With the popularity of content-based mobile applications, 
popular content downloading in the small cells needs to be 
optimized to improve network performance and enhance user 
experience. In this paper, we develop an efficient scheduling 
scheme for popular content downloading in mm Wave small cells, 
termed PCDS (popular content downloading scheduling), where 
both D2D communications in close proximity and concurrent 
transmissions are exploited to improve transmission efficiency. 
In PCDS, a transmission path selection algorithm is designed 
to establish multi-hop transmission paths for users, aiming 
at better utilization of D2D communications and concurrent 
transmissions. After transmission path selection, a concurrent 
transmission scheduling algorithm is designed to maximize the 
spatial reuse gain. Through extensive simulations under various 
traffic patterns, we demonstrate PCDS achieves near-optimal 
performance in terms of delay and throughput, and also superior 
performance compared with other existing protocols, especially 
under heavy load. The impact of the maximum number of hops 
of transmission paths on its performance is also analyzed for a 
better understanding of the role of D2D communications. 


I. Introduction 

Mobile data is growing explosively. Some industry and 
academic experts predict a 1000-fold demand increase by 2020 
m. In order to meet such sharp growth, there is increasing 
interest in deploying small cells in higher frequency bands, 
such as the millimeter wave (mmWave) bands between 30 
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and 300 GHz, underlying the conventional macrocell networks 
to significantly boost the network capacity. This deployment 
is usually referred to as heterogeneous cellular networks 
(HCNs). With huge unlicensed bandwidth (e.g., the 7 GHz 
spectrum between 57 GHz and 64 GHz approved by the 
Federal Communications Commission), small cells in the 60 
GHz band have gained considerable attention from academia, 
industry, and standardization bodies. 60 GHz communications 
enable multi-gigabit data rates, and broadband applications 
like high-speed data transfer between devices (e.g., cam¬ 
eras, ipads, tablets, and notebooks), real-time streaming of 
both compressed and uncompressed high definition television 
(HDTV), wireless gigabit ethernet, and wireless gaming can 
be supported. Recently, rapid progress in 60 GHz mmWave 
circuits, including on-chip and in-package antennas, radio 
frequency power amplifiers, low-noise amplifiers, voltage- 
controlled oscillators, mixers, and analog-to-digital converters 
has paved the way for more cost-effective devices in the 60 
GHz band m, 0, H. Several standards have been defined for 
indoor wireless personal area networks (WPAN) or wireless 
local area networks (WLAN), for example, ECMA-387 0 , 
IEEE 802.15.3c 0, and IEEE 802.11ad Q. 

However, to popularize and standardize products in the 
60 GHz band worldwide, several technical challenges need 
to be addressed. Due to high carrier frequency, 60 GHz 
communications suffer from high propagation loss. The free 
space loss in the 60 GHz band is 21.6 dB worse than 5 
GHz for omnidirectional communications. Thus, directional 
antennas should be synthesized at both transmitter and receiver 
to form directional high gain beams to combat the significant 
propagation loss 0, El, Go). However, with directional 
transmissions, the third party nodes cannot perform carrier 
sensing to avoid contention with the current transmissions, 
which is referred to as “deafness” GD. In addition, mmWave 
links are also vulnerable to blockage by obstacles such as 
humans and furniture due to weak diffraction ability Gl- 
Blockage by a human penalizes the link budget by 20-30 dB. 

On the other hand, content popularity in mobile networks 
has been found to follow the classic Zipf’s law G3l . It is 
demonstrated that a small amount of content accounts for 
the majority of requests, which is very popular among the 
majority of users. In the 60 GHz small cells, popular content 
downloading is widely used in many cases, such as the 
Group/Broadcast communication services like police, fire, and 
ambulance in public safety networks, device discovery, and 
advertising messages broadcasting Gl- With directional trans¬ 
missions, the wireless broadcast channel in 3G/4G networks 
is not feasible in the 60 GHz band G3, Ga. At the same 
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time, in the user-intensive region, there is a high probability 
that two user devices are located near to each other. In 
this case, device-to-device (D2D) communications in physical 
proximity can be exploited for content downloading as well as 
saving power and improving the spectral efficiency ESI. D2D 
communications in the same carrier frequencies as today’s 
cellular systems have significant interference to the cellular 
users, which limits the benefits of D2D communications. 
However, for D2D communications in the mmWave bands, 
there will be no interference with the cellular systems, and 
the higher achievable transmission rates in mmWave bands 
also further increase the benefits of D2D communications. 
Multi-hop D2D communications in mmWave bands can also 
be utilized to overcome blockage by obstacles E2l . ED- In 
addition, in the directional communication scenario, there is 
less interference between links, and concurrent transmissions 
are enabled to unleash the potential of spatial reuse. With these 
fundamental differences between mmWave communications 
and existing systems using lower carrier frequencies (e.g., 
from 900 MHz to 5 GHz), new scheduling schemes or MAC 
protocols for popular content downloading are needed to fully 
reap the benefits of D2D communications in mmWave bands. 

In this paper, we propose an efficient scheduling scheme for 
popular content downloading in mmWave small cells, termed 
PCDS, where D2D communications and concurrent transmis¬ 
sions are fully exploited to improve transmission efficiency. 
In PCDS, users far from the access point (AP) receive the 
popular content from neighboring users in close proximity 
that have received the content. Meanwhile, concurrent link 
transmissions are exploited to significantly improve network 
capacity. The contributions of this paper are four-fold, which 
are summarized as follows. 

• We design a transmission path selection algorithm to 
establish multi-hop transmission paths for users, aiming 
at exploiting better channel conditions (higher transmis¬ 
sion rate) between nodes in close proximity to improve 
transmission efficiency. In addition, the better use of 
concurrent transmissions (spatial reuse) is also considered 
in the algorithm to enhance scheduling efficiency. 

• After transmission path selection, we formulate the op¬ 
timal multi-hop transmission scheduling problem into a 
mixed integer linear programming (MILP), i.e., to mini¬ 
mize the number of time slots to send the popular content 
from the AP to all users. Concurrent transmissions are 
explicitly considered under the signal to interference plus 
noise ratio (SINR) interference model in this formulated 
problem. 

• We propose an efficient and practical concurrent trans¬ 
mission scheduling algorithm to solve the formulated NP- 
hard problem with low complexity. 

• Extensive simulations under various traffic patterns are 
carried out to demonstrate near-optimal performance of 
PCDS in terms of delay and throughput, and the superior 
network performance of PCDS compared with other 
schemes. In addition, we also analyze the impact of 
the maximum number of hops in the transmission path 
selection algorithm on the performance of PCDS, which 


provides references for the choice of this parameter in 
practice. 

The rest of this paper is organized as follows. The related 
work on directional MAC protocols for small cells in the 
60 GHz band is introduced and discussed in Section |II] 
Section |III| introduces the system model, and illustrates the 
procedure and problems of PCDS. Section |IV] presents the 
proposed transmission path selection algorithm to fully exploit 
D2D transmission and spatial reuse. After transmission path 
selection, we formulate the problem, and propose a concurrent 
transmission scheduling algorithm in Section [V] Section [Vl] 
presents the performance evaluation of PCDS under various 
traffic patterns. Finally, we conclude this paper in Section [Vlll 

II. Related Work 

To keep up with surging demand, small cells densely 
deployed underlying the conventional homogeneous macrocell 
network have been proposed to create the dual benefits of 
higher quality links and more spatial reuse m, US). How¬ 
ever, reducing the radii of small cells in the same carrier 
frequencies as today’s cellular systems to reap the spatial reuse 
benefits is fundamentally limited by interference constraints 
ll20ll . By using higher frequency bands, such as the millimeter 
wave (mmWave) bands between 30 and 300 GHz, small cells 
can significantly boost the overall network capacity due to less 
interference with macrocells and higher achievable data rates 

EH, Eo). 

There has been some related work on directional MAC 
protocols for small cells in the 60 GHz band ll2Tll . Il22ll . 
EH, ll24ll . II 25 II . Some work is based on TDMA 0, 0. 
Cai et al. Il22ll derived exclusive region (ER) conditions that 
concurrent transmissions always outperform TDMA. In two 
protocols 1231 . l24l based on IEEE 802.15.3c, concurrent 
transmissions are enabled if the multi-user interference (MUI) 
is below a specific threshold. Qiao et al. ED proposed a 
concurrent transmission scheduling algorithm for an indoor 
IEEE 802.15.3c WPAN, where concurrent transmissions are 
optimized to maximize the number of flows with the quality 
of service requirement of each flow satisfied. Qiao et al. 1^ 
also proposed a multi-hop concurrent transmission scheme to 
address the link outage problem and combat huge path loss 
to improve flow throughput. For the TDMA based protocols, 
unfair medium time allocation problem exists for individual 
users under bursty data traffic 1151 . 

There are also other centralized work on MAC protocols 
for small cells in the 60 GHz band. Gong et al. 1^ proposed 
a directive CSMA/CA protocol, which solves the deafness 
problem by the virtual carrier sensing. However, the spatial 
reuse is not considered. In the multihop relay directional 
MAC (MRDMAC), relay paths are established to steer around 
obstacles ED- However, since most transmissions are through 
the piconet coordinator (PNC), concurrent transmissions are 
also not considered in MRDMAC. Chen et al. iTTl proposed 
a spatial reuse strategy for an IEEE 802.11 ad WPAN, 
where two different service periods (SPs) are scheduled to 
overlap with each other. Since only two links are considered 
for concurrent transmissions, the spatial reuse is not fully 
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exploited. Son et al. US proposed a frame based directional 
MAC protocol (FDMAC), which amortizes the scheduling 
overhead over multiple concurrent transmissions in a row to 
achieve high efficiency. The core of FDMAC is the Greedy 
Coloring (GC) algorithm, which fully exploits spatial reuse 
and greatly improves the network throughput. FDMAC also 
has a good fairness performance and low complexity. Chen 
et al. 12811 proposed a directional cooperative MAC protocol 
(D-CoopMAC) to coordinate the uplink channel access in an 
IEEE 802.1 lad WLAN. In D-CoopMAC, a two-hop relay path 
of high channel quality from the source to the destination is 
established to replace the direct path of poor channel quality 
for higher transmission efficiency. Niu et al. ini proposed 
a blockage robust and efficient directional MAC protocol 
(BRDMAC), which overcomes the blockage problem by two- 
hop relaying. In BRDMAC, relay selection and spatial reuse 
are jointly optimized to achieve robust network connectivity 
and also improve network performance. Niu et al. 12^ also 
proposed a channel transmission rate aware directional MAC 
protocol, RDMAC, where both the multirate capability of 
links and spatial reuse are exploited to improve network 
performance. There are two stages in RDMAC. The first stage 
measures the channel transmission rates of links by a heuris¬ 
tic algorithm, which can compute near-optimal measurement 
schedules with respect to the total number of measurements. 
The second stage accommodates the traffic demand of links 
by a heuristic transmission scheduling algorithm, which can 
compute near-optimal transmission schedules with respect to 
the total transmission time. Recently, Niu et al. proposed a 
joint transmission scheduling scheme for the radio access and 
backhaul of small cells in 60 GHz band, termed D2DMAC, 
where a path selection criterion is designed to enable D2D 
transmissions for performance improvement. 

There are also distributed MAC protocols for small cells 
in the 60 GHz band IIbTI . II^ . The memory-guided direc¬ 
tional MAC (MDMAC) alleviates the deafness problem by 
incorporating a Markov state transition diagram, and employs 
memory to achieve approximate time division multiplexed 
(TDM) schedules ED. The directional-to-directional MAC 
(DtDMAC) solves the asymmetry-in-gain problem with both 
senders and receivers operate in a directional-only mode 
E 2 I . DtDMAC adopts an exponential backoff procedure for 
asynchronous operations, and the deafness problem is also 
alleviated by a Markov state transition diagram. 

To the best of our knowledge, none of the work above 
considers the content popularity in the problem, and we try to 
exploit the D2D transmission in close proximity to enhance 
spatial reuse for popular content downloading in the 60 GHz 
small cells. 

HI. System Overview 

A. System Model 

We consider a mm Wave small cell of n nodes, one of which 
is the AP, and the rest are users (UEs). The system time is 
partitioned into non-overlapping time slots of equal length, 
and the AP synchronizes the clocks of UEs and schedules 
the medium access of all nodes to accommodate their traffic 


demands. With electronically steerable directional antennas 
equipped at the AP and UEs, directional transmissions are sup¬ 
ported between any pair of nodes. In addition, a bootstrapping 
program is run in the system such that the AP knows the up-to- 
date network topology and the location information of other 
nodes ES. El- The network topology can be obtained by 
the neighbor discovery schemes in EH. Location information 
can be obtained based on wireless channel signatures, such 
as angle of arrival, time difference of arrival, or the received 
signal strength E4l . We also assume all nodes are half-duplex, 
and each node has at most one connection with one neighbor 
simultaneously. 

Eor small cells in the 60 GHz band, non-line-of-sight 
(NLOS) transmissions suffer from higher attenuation than line- 
of-sight (LOS) transmissions ESl . E^ . E3- In Ref. ESI, the 
path loss exponent in the LOS hall is 2.17, while the path loss 
exponent in the NLOS hall is 3.01. If the distance between 
the transmitter and the receiver is 10 m, the gap in the path 
loss is about 10 dB for the LOS hall and the NLOS hall. 
When operating in a power-limited regime, a 10 dB power loss 
requires a 10-fold reduction in transmission rate to maintain 
the same reliability. On the other hand, restricting to the LOS 
path can maximize the power efficiency since the LOS path is 
strongest. In addition, NLOS transmissions in the 60 GHz band 
also suffer from a shortage of multipath, and it is reasonably 
accurate to calculate link budget for a directional LOS link 
based on the simple additive white Gaussian noise channel 
model im. Thus, we only consider the Line-of-Sight (LOS) 
transmissions to achieve high transmission rates and maximize 
power efficiency ca. In im, it has been demonstrated that 
relaying using LOS links is able to obtain robust network 
connectivity and high throughput, and multiple schemes have 
been proposed to overcome blockage by multi-hop relaying 

El, ESI. 

We denote the directional link from node i to j by 
and assume the beamforming process between node i and j 
has been completed. According to the path loss model, the 
received power at node j for P[j (mW) can be estimated 
by 

P[^=koGtii,j)Gr{i,j)l^^Pu ( 1 ) 

where Pt (mW) denotes the transmission power, ko is a 
constant coefficient and proportional to (A denotes the 

wavelength), Gt{i,j) denotes the transmit antenna gain of 
node i in the direction from node i to j, Gr{i,j) denotes 
the receive antenna gain of node j in the direction from node 
i to j, lij (m) denotes the distance between transmitter i and 
receiver j, and r denotes the path loss exponent ED- 

Due to directional transmissions, there is less interference 
between links, and concurrent transmissions (spatial reuse) are 
enabled to improve network capacity. However, due to the 
limited communication range, the interference between links 
cannot be neglected. In this paper, we adopt the interference 
model in ED- Then for link (u,u) and the received 

interference power at node j from node u can be calculated 
as 

luvij — pk^Gtiu., j^Griu., j^luj Pt: ( 2 ) 

where p denotes the MUI factor related to the cross correlation 
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of signals from different links ET\ . If we denote the set of 
links that transmit concurrently with link {i,j) by C^, then 
the interference power can be calculated as 

^ ^ ^uvij • ( 3 ) 

{u,v)€:Cij 

Since each node has at most one connection with one neighbor, 
adjacent links cannot be scheduled concurrently CSl. Thus, 
Cij should not include links that are adjacent to link 
The received SINR at receiver j can be calculated as 


P ^ _ koGt{iJ)Gr{i,j)kj ^Pt _ 

NoW + p E koGtiuJ)Gr{u,j)lur"Pt' 

{u,v)^Cij 

where W (Hz) is the bandwidth, and Nq (mW/Hz) is the one¬ 
sided power spectra density of white Gaussian noise ll2Tll . 

For each link (i,j), we denote the minimum SINR to 
support its transmission rate by 7 ( 0 ^). Therefore, link 
(i, j)’s SINR Tij should be greater than or equal to j{cij) 
to support its concurrent transmissions with other links. 

Due to the difference in distance between the transmitter 
and receiver, the channel transmission rates of different links 
vary significantly. We denote the n x n channel transmission 
rate matrix by C, whose {i,j) element is denoted by c^. 
Cij indicates the channel transmission rate of link (i,j), 
and numerically it is equal to the number of packets that 
link {i,j) can transmit in one time slot. We assume that a 
channel transmission rate measurement procedure executes in 
the system to update the channel transmission rate matrix 
123 . This process is summarized as follows; the transmitter 
of each link firstly transmits measurement packets to the 
receiver. Then with the measured signal to noise ratio (SNR), 
the receiver obtains the achievable transmission rate and 
appropriate modulation and coding scheme (MCS) according 
to the SNR and MCS correspondence table. Lastly, the receiver 
will transmit an acknowledgement packet to inform the sender 
of the transmission rate and MCS. Under low user mobility, 
the procedure is executed periodically, and the measurement 
results will be reported to the AR 

In this paper, we consider the popular content downloading 
traffic from the AP to all users. Traditionally, the traffic is 
distributed to each UE from the AP one by one in sequence. 
In PCDS, both D2D transmissions in close proximity and 
concurrent transmissions of links are exploited to improve 
transmission efficiency significantly. In Fig. [T] (a), we give a 
time-line illustration of PCDS, where there are six UEs and 
an AP. In PCDS, time is divided into a sequence of non¬ 
overlapping frames na, and there are two phases in each 
frame, the scheduling phase and transmission phase. In the 
scheduling phase, the AP obtains the packets to be downloaded 
by users from the network layer, which takes time td', then 
the AP computes the content downloading source for each 
UE, and calculates a schedule to distribute these packets to 
all UEs, which takes time tsch\ lastly the AP pushes the 
schedule and selected downloading sources to the UEs in 
sequence, which takes time tpush- In the transmission phase, 
all nodes start transmissions following the schedule until the 


packets are distributed to all UEs. There are multiple pairings 
in the transmission phase, and in each pairing, multiple links 
are activated simultaneously for concurrent transmissions. We 
show an example of PCDS operation in a small cell of six UEs 
in Fig. [H and the more detailed explanation of this example 
can be found in Section IIII-BI 


B. Problem Overview 


To improve transmission efficiency, appropriate transmis¬ 
sion paths from the AP to UEs need to be selected to 
fully exploit the potential of D2D communications, and also 
the benefits of concurrent transmissions. After transmission 
paths are selected, efficient concurrent transmission scheduling 
algorithm should be designed to unleash the potential of spatial 
reuse. 

Now, we consider a small cell of an AP and six UEs, 
whose topology is plotted in Fig.[T](b). We denote the content 
downloading traffic by d, and there are d = 6 packets to be 
distributed to all UEs. The channel transmission rate matrix 
C is 


C = 


/ 0 
1 
1 
2 
2 
1 
3 


1 1 2 2 1 3 \ 
0 1112 3 
10 1112 
110 3 11 
113 0 11 

2 1110 1 

3 2 1110 / 


(5) 


where the first six rows/columns are UEs, and the AP is 
the last row/column. We can observe that the (7,1) element 
of C is 3, which means that the link from AP to UE 1 
is able to transmit 3 packets in one time slot. If we select 
three transmission paths, AP —UEl —UE4 —UE5, 
AP ^ UE2 UE6, and AP ^ UE3 as in Fig. [U (b), 
then we can obtain a schedule to complete traffic downloading 
of all UEs as shown in Fig. [T] (a). This schedule has three 
pairings, and in the first pairing, the AP transmits the packets 
to UE 1 for two time slots. In the second pairing, the AP 
transmits to UE 2, and UEl transmits the packets to UE 4 
for three time slots. In the third pairing, link AP —UES, 
UE2 UE6, and UE4 UE5 are activated to distribute the 
packets to UE 3, UE 5, and UE 6. As we can see, the schedule 
completes traffic downloading in 8 time slots. However, if 
these packets are distributed from the AP to UEs one by one 
without exploiting the D2D communications and spatial reuse, 
we will need at least 25 time slots to distribute these packets 
to all UEs. Since selection of transmission paths has a big 
impact on the efficiency of spatial reuse, we should optimize 
the transmission paths for higher transmission efficiency. Then 
efficient transmission scheduling should also be investigated to 
fully explore the potential of spatial reuse. 


IV. Transmission Path Selection 

In this section, we propose a heuristic transmission path 
selection algorithm to establish multi-hop transmission paths 
for better use of D2D communications and spatial reuse to 
improve transmission efficiency. 
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(a) Time-line illustration of PCDS operation 
Fig. 1. An example of PCDS operation in a small cell of six UEs. 

In the algorithm, the AP and UEs that have received the con¬ 
tent downloading packets are scheduled to be the downloading 
sources of the nearest UEs to fully exploit the advantages of 
D2D communications. To reduce the number of adjacent links 
for better usage of spatial reuse in transmission scheduling, 
each UE is allowed to be the downloading source of one 
nearby UE once. The AP is allowed to be the downloading 
sources of multiple UEs. We denote the set of UEs in the small 
cell by U. We also denote the set of UEs whose downloading 
sources have been selected by Ub, and the set of UEs whose 
downloading sources are not selected is denoted by Uc. When 
lUbI < |Uc|, i.e., there are more UEs that are not selected than 
those UEs that selected, the algorithm reaches out from the 
nodes in U;, to find the UEs in Uc with the largest transmission 
rates to establish downloading links. When jUbj > |Uc|, the 
algorithm reached out from the nodes in Uc to find the UEs in 
Uf, with the largest transmission rates to establish downloading 
links. 

Eor each UE u € U, we define a binary variable to 
indicate whether the downloading source of UE u has been 
selected. If so, bu is equal to 1; otherwise, is equal to 
0. The set of UEs whose downloading sources are selected 
currently is denoted by U*. We denote the set of the selected 
transmission paths by P;,. We also denote the AP in the small 
cell by a. Eor each UE u, we define a binary variable r„ to 
indicate whether UE u has been the downloading source for 
other UEs. If UE u has been the downloading source for other 
UEs, = 1; otherwise, = 0. Eor the AP, is set to 0. 
Eor each path p G Pb, its number of hops is denoted by Hp, 
and its last node is denoted by Ip. We also define a parameter, 
Umax, to denote the maximum possible number of hops for 
each path in Pb. 

The pseudo-code of the transmission path selection algo¬ 
rithm is presented in Algorithm [T] The algorithm iteratively 
schedules UEs into the transmission paths until all UEs are 
scheduled, as in line 3. When |Ut,| < |Uc|, the algorithm ex¬ 
tends the transmission paths in Pf, by searching the neighbors 
of nodes in U;, with the largest transmission rates, as in lines 
5-12. Lines 6-7 obtain the UE with the largest transmission 


^UE3 



rate from the AP to the UE, and generate a new path from the 
AP to the selected UE in P;,. Lines 8-12 extend the paths in Pf, 
to schedule the UEs in Uc into Pf,. All paths in P;, are restricted 
to have at most Hmax hops, as in line 10 and 16. In line 10, the 
condition of = 0 is required since each UE is allowed to be 
the downloading source of one neighboring UE once. In line 
12, the transmission paths in Pj are extended to the UE with 
the largest transmission rate from the last node to the node in 
Uc, and ru is set to 1 since UE u is the downloading source of 
the selected UE. When |Uf,| > |Uc|, the algorithm extends the 
transmission paths in Pf, by searching the neighbors of nodes 
in Uc with the largest transmission rates, as in lines 14-25. In 
lines 14-18, the set of possible downloading sources for UEs 
in Uc is denoted by U^, which is obtained by checking the 
number of hops of paths in Pf, and whether the UE has been 
the downloading source for another UE, as in line 16. In lines 
19-25, the downloading source for each UE in Uc is selected 
by obtaining the node in Ur with the largest transmission rate 
to the UE in Uc. In line 26, U* (the set of UEs that have 
been scheduled into Pf,) is added to Uf, (the set of UEs whose 
downloading sources have been selected), and removed from 
Uc. 

Applying the transmission path selection algorithm with 
Hmax = 3 to the example in Section IIII-BI we obtain the 
transmission paths as path AP —UEl —UE4 —UE5, 
path AP —UE2 UE6, and path AP —UES, which have 
been illustrated in Pig. [T] (b). The computational complexity 
of Algorithm [T] is (9(|Up). 

V. Concurrent Transmission Scheduling 

After transmission path selection, we schedule the down¬ 
loading links for each UE in the transmission phase. The 
content downloading for all UEs needs to be completed with 
the minimum number of time slots to maximize transmission 
efficiency. In this section, we first formulate the optimal multi¬ 
hop transmission scheduling problem into a mixed integer 
program based on the problem formulation in PDMAC (the 
frame-based scheduling directional MAC protocol) Ga, and 
then propose a practical concurrent transmission scheduling 
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algorithm to fully exploit spatial reuse for maximizing trans¬ 
mission efficiency. 

A. Problem Formulation 

We denote the transmission schedule in the transmission 
phase by S, which has K pairings. For each UE u, we denote 
its downloading source by Su, and s„ may be the AP or other 
UEs. The traffic demand to be distributed to all UEs is denoted 
by d. Eor each pairing, we define an n x (n — 1) matrix 
to indicate the links scheduled to communicate in the 
A:th pairing. Since we focus on popular content downloading 
in this paper, where the traffic is from the AP to UEs, the 
links from users to the AP are not considered in matrix 
A^. The rows of A^ indicate the downloading sources of 
UEs, including all UEs and the AP, while the columns of 
A^ indicate all UEs. The (*,j) element of A^, indicates 
whether link {i,j) is scheduled for transmission in the fcth 


Algorithm 1: Transmission Path Selection. 

1 Input: The set of UEs in the small cell U; 

2 Channel transmission rate matrix C; 

3 Initialization: Ub = 0; Uc = U; Ut = 0; Pb = 0; Va = 0; 

4 bu = 0 and r„ = 0 for each u € U; 

5 while |Uc| > 0 do 

6 

7 

8 
9 

10 
11 
12 
13 


14 


15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 
27 


28 


Ut = 0; 

if |Ub| < |Uc| then 

Obtain UE it G Uc with the largest Cau', 
Pb=Pb U {a u}; bu = 1; Ut=Ut U {it}; 
for each it G Ub do 

Obtain the path p G Pb with Ip = it; 


if Tu = 0 and Hj, < 


then 


Obtain UE u G Uc with the largest Cuv 
and by = 0 ; 

Extending p G Pb to it; by = 1; = 1; 

Ut=Ut U {i;}; 


else 


Uc = 0; 

for each p G Pb do 

il (Hp < iFmax and = Oj then 
|_ Uc = Uc U {Ip}; 

Uc = Uc U {a}; 
for each it G Uc do 

Obtain UE u G Uc with the largest Cyu and 

Ty = 0 ; 

if u is a then 

|_ Pb=Pb U (a -)■ It}; bu = 1; Ut=Ub U {it}; 

else 

Obtain the path p G Pb with Ip = v; 
Extending p G Pb to it; = 1; = 1; 

Ut=Ut U {it}; 


Ub = Ub U Ut; Uc = Uc — Ut; 


29 Return 


pairing. If link (i,j) is scheduled for transmission in the 
kth pairing, = 1; otherwise, a{j = 0. We denote the 
number of time slots for the fcth pairing by 5^ ifTSl . To 
maximize transmission efficiency, the transmission schedule 
should complete the traffic downloading for all UEs with 
a minimum number of time slots. Therefore, the objective 

K 

function to be minimized is ^ 5^ ifTSll . Now, we analyze 

fc=i 

the system constraints of this problem. 

Eirst, all UEs can download the traffic once in the schedule, 
which can be expressed as follows. 

K 

= (6) 

k^l 

Second, the schedule should complete the traffic download¬ 
ing for all UEs, which can be expressed as follows. 

K 

k=l 

Third, since UE u obtains the common packets after UE 
Su received the common packets, the downloading of UE 
should be scheduled ahead of the downloading for UE u, 
which can be formulated as follows. This constraint represents 
a group of constraints since K varies from 1 to K, 

K K 

K=lr^K. ( 8 ) 

fc=l fc=l 

Eourth, due to the half-duplex assumption, adjacent links 
cannot be scheduled for concurrent transmissions. Thus, the 
links that share common nodes cannot be scheduled in the 
same pairing, which can be expressed as follows. 


“suU + ^ Ij if {su,u) and {sy,v) are adjacent. (9) 

Einally, to enable concurrent transmissions, the SINR of 
each link in the same pairing should be able to support its 
transmission rate, which can be formulated as follows. 

koGt{Su,u)Gr{Su,u)la^u~'^ ^ , 

WNo+pT,I: koGt(s^,u)Gr-{s^,u)h^u-^Ptai^^ - 

Su V 

V u,k. 

( 10 ) 

If link {su,u) is not scheduled in the fcth pairing, ^ is 0, 
and this constraint does not apply. Otherwise, the SINR of 
link (su,u) should be greater than or equal to jics^u)- 
Therefore, the problem of optimal transmission scheduling 
(PI) can be formulated as follows. 

K 

(PI) min (11) 

k^l 


s. t. Constraints (l^-(fT 0 lt. 
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B. Problem Reformulation 

Since constraints (|7]i and (fTOl i are nonlinear, problem PI is 
a mixed integer nonlinear programming (MINLP), which is 
generally NP-hard. By the Reformulation-Linearization Tech¬ 
nique (RLT) ca, ED, we linearize constraints 0 and (Uni). 
The RLT procedure is used to produce tight linear program¬ 
ming relaxations for an underlying nonlinear and non-convex 
polynomial programming problem. In the RLT procedure, 
for each nonlinear term, a variable substitution is applied to 
linearize the objective function and the constraints. In addition, 
nonlinear implied constraints for each substitution variable are 
generated by taking the products of bounding terms of the 
decision variables, up to a suitable order Ea. 

For the second order term in constraint 0, we define a 
substitution variable = 5^ ■ a' 

Q < 5^ < T, where T = max{ 

0 < Os u < 1, we can obtain the RLT bound-factor product 


^SuU' 

d 


6^ is bounded as 
,V u} Ea. With 


constraints for Q ^ as 


£ >0 

<,SuU — ^ 


- c. > 0 

> 0 

rji ck rji ^k 


T-atu 


V u, k. 


[T-6'^-T-al^ + es^^>0 

For constraint (fTOl t, we first convert it to 


( 12 ) 




k 

Su,U 


> l{Cs^u)P EE koGt{Sy,u)GriSv,u)ls^u Ptas^u(^SvV^ 


V u, k. 


(13) 


For the second order term we define = 

substitution variable. Since 0 < <u < 1 and 
0 < < 1, the RLT bound-factor product constraints for 

are 






SuUS^V — H 


„k _ k 

^SuUSvV 

-| k _ k 

^ ^SuU ^SyV 


> 0 


V' 


, k. 


(14) 


J >0 

"^SuUSvV — ^ 


By substituting £g ^ and Wg „ into constraints 0 and 
Eoli, problem PI can be reformulated into a mixed integer 
linear programming (MILP) as 


K 


k^l 


(15) 


s. t. 


K 


E ■ Cs^u) >d,yu] 




(16) 


{^kQGti^Sut ujGri^Su^ u)ls^u Pt ^ ^SuU 

> l{Cs^u)p EE k^Gti^Sy ^ U)Gy{^Sy ^ U)lsyU Pt^s^US^V^ 

Sv V 

V u, k] 

(17) 

Constraints 0, 0, 0, (fT^ . (fl4l i. 

Considering the example in Section IIII-Bl with the selected 
transmission paths by Algorithm [T] we solve the MILP using 
an open-source MILP solver, YALMIP BOl . The optimal 
schedule consists of three pairings, and completes traffic 
downloading with eight time slots, which has been illustrated 
in Fig. [T] (a). However, problem PI has Cl((|U|)^iT) deci¬ 
sion variables, and C1((|U|)^/C) constraints, and using the 
optimization software to solve the problem takes significant 
computation time, which is unsuitable for mmWave systems 
ESI. Therefore, to implement efficient concurrent transmis¬ 
sion scheduling in practical mmWave small cells, heuristic 
concurrent transmission scheduling algorithms with low com¬ 
putational complexity are needed, which will be constructed 
in the following subsection. 

C. Concurrent Transmission Scheduling Algorithm 

After the transmission path selection by Algorithm [T] we 
propose a heuristic concurrent transmission scheduling algo¬ 
rithm to compute the near-optimal transmission schedules with 
much lower complexity than that of optimization software, 
borrowing the design ideas of the Greedy Coloring (GC) algo¬ 
rithm. Since adjacent links cannot be scheduled concurrently 
in the same pairing, the set of links in each pairing can be 
represented by a matching, and thus the maximum number 
of links in the same pairing is [n/2j ESl . We denote the 
set of links scheduled in the fth pairing by E*, and the set 
of vertices of the links in E* is denoted by V*. Thus, the 
problem of optimal transmission scheduling is to obtain the 
matching in each pairing to complete traffic downloading with 
a minimum of time slots. In each pairing, our algorithm first 
obtains transmission paths with the largest number of hops, 
and then the hop with the largest weight among the first 
unscheduled hops of these paths will be visited first ESI. To 
maximize spatial reuse, the algorithm iteratively allocates as 
many links as possible into each pairing with the concurrent 
transmission conditions satisfied. For the hops on the same 
path, the preceding hops should be scheduled first since each 
UE can be the downloading source for other UEs only after 
it has received the common packets. 

We denote the set of transmission paths selected by Algo¬ 
rithm [T]by Pf,. Eor each path p € P;,, we denote its number of 
hops by Hp. The set of hops in Pj, is denoted by Ef,. We denote 
the fith hop of path p by (p, h), and define its weight Wph as 
the number of time slots to complete traffic downloading. We 
denote the first unscheduled hop on path p by (p, Fp), where 
Fp indicates its hop number. We also denote the transmitter of 
(p, Fp) by SpFp, and the receiver by VpPp. In the tth pairing, 
we denote the set of paths that are not visited yet by P^. 










The pseudo-code of the concurrent transmission scheduling 
algorithm is presented in Algorithmic After obtaining the set 
of the selected transmission paths, Pf,, we obtain the set of 
hops in Pb as Ef,. The algorithm iteratively schedules the hops 
in Eb into each pairing until all hops in Eb are scheduled, as in 
line 6. In each pairing, we first visit the paths with the largest 
number of unscheduled hops, as in line 11. Then the first 
unscheduled hop with the largest weight is visited and selected 
as the candidate hop of this pairing, as in line 12. In line 13, 
the algorithm examines whether the candidate hop is adjacent 
to the hops already in this pairing. If the candidate hop is not 
adjacent to the hops already in this pairing, this candidate hop 
will be added to this pairing to check whether the concurrent 
transmission conditions of this pairing are satisfied, as in line 
14 and lines 15-18. If the SINR of one link in this pairing 
cannot support its transmission rate, the candidate hop will be 
removed from this pairing, as in line 17 and 23. Otherwise, the 
number of time slots of this pairing is updated to accommodate 
the traffic demand of this candidate hop, and this hop is 
removed from Eb in line 19. The visited path is removed from 
in line 24. If the number of links in each pairing reaches 
[n/2j or there is no path unvisited, the algorithm will start 
scheduling for the next pairing as in line 10, and the scheduling 
results for this pairing will be outputted in line 25. 

For the example in Section IIII-BI with the transmission 
paths selected by Algorithm [T] Algorithm |2] gives the same 
schedule as YALMIP HO). However, since the outer while loop 
has |Eb| iterations, which at most is |U|, and the inner while 
loop and for loop have |E*| iterations, which is at most [n/2j, 
our algorithm has the computational complexity of C1(|U|^), 
which is much lower than YALMIP. 

VI. Performance Evaluation 

In this section, we evaluate the delay and throughput of our 
proposed popular content downloading scheme under various 
traffic patterns. We compare it with two existing schemes, and 
also investigate the impact of the maximum number of hops 
Hmax on the performance of our scheme. 

A. Simulation Setup 

In the simulation, we consider a typical mmWave small cell 
of an AP and ten UEs. We assume the AP is located in the 
center of a square area with 10m x 10m, and the ten UEs 
are uniformly distributed in the area. Adopting the simulation 
parameters in Table II of ifT^ . the duration of a time slot is 
set to 5 pLS. The packet size is set to 1000 bytes. According 
to the distances between nodes, we set three transmission 
rates, 2 Gbps, 4 Gbps, and 6 Gbps. With a transmission rate 
of 2 Gbps, a packet can be transmitted in a time slot na. 
The AP obtains the common packets from the upper layer 
or pushes the schedule to UEs in one time slot E). Eor the 
simulated small cell, it takes a few time slots for the AP to 
compute the transmission paths and transmission schedule. In 
the simulation, we assume nonadjacent links are able to be 
scheduled for concurrent transmissions. 

Eor the downloading traffic, we adopt two modes: 


1) Poisson Process: The packets arrive following a Poisson 
process with arrival rate A. The traffic load, denoted by Tp, is 
defined as 

AxixIUI 


T,= 


R 


where L denotes the size of data packets, |1[J| denotes the 
number of UEs, and R is set to 2 Gbps. 

2) Interrupted Poisson Process (IPP): The packets arrive 
following an interrupted Poisson process (IPP). The param¬ 
eters of the interrupted Poisson process are Ai, A 2 , pi and 
P 2 , and the arrival intervals of an IPP obey the second-order 
hyper-exponential distribution with a mean of 


E{X) = ^ + ^. (19) 

Ai A2 

Since the IPP can also be represented by an ON-OEE process, 
IPP traffic is typical bursty traffic. The traffic load in this 
mode is defined as 


= 


L X |U| 


( 20 ) 


E{X) X R' 

The simulation length is 10® time slots, and the delay 
threshold is set to 2.5 x 10"^ time slots. The packets with delay 
larger than the threshold are discarded by UEs. We evaluate 
the system by the following three performance metrics: 


Algorithm 2: Concurrent Transmission Scheduling. 

r Input: The set of selected transmission paths, Pb; 

2 The set of hops in Pf,, Eb; 

3 The number of hops for each path p € Pb, Hp 

4 The weight of each hop {p, h) € Eb, Wph', 

5 Initialization: Set Fp = 1 for each p € Pb; t=0; 

6 while |Eb| >0 do 

7 

8 
9 

to 
ri 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 


t=t+ 1 ; 

Set = 0, E‘ = 0, and S* = 0; 

Set P^ with P^ = Pb; 

while |P^| > 0 and |E*| < [n/2j do 

Obtain the set of unvisited paths with the largest 
number of unscheduled hops, ¥mh', 

Obtain the hop (p, Fp) of path p G Pmh with the 
largest weight, WpFp', 
if SpFp 4- V* and VpF^ ^ V* then 

E‘ = E‘ U {{p,Fp)y, V‘ = V* U {spF,,rpF,}; 
for each link (p, h) in E* do 

Calculate the SINR of link (p, h), Tph', 
if Epb < licph) then 
1^ Go to line 23 

(j* = max(b*, WpFj,), Eb = Eb - (p, Fp); 

if Fp == Hp then 


|_ Pb = Pb - p; 

Fp = Fp + 1; Go to line 24 
E‘ =E‘-{(p,Fp)}; V‘ =\ 

K = K- 

Output E* and b*; 

26 Return E* and <5* for each pairing. 


' - {SpF^HpF^y, 
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1) Average Transmission Delay: The average traffic down¬ 
loading delay from the AP to UEs, which is in units of time 
slots. 

2) Network Throughput: The number of successfully 
transmitted packets to all UEs until end of the simulation. 
With the constant simulation length and fixed packet size, 
the total number of successfully transmitted packets is a good 
indication to show the throughput performance. 

3) D2D Ratio: The fraction of packets transmitted by 
device-to-device links over the total number of successfully 
transmitted packets. This metric is used to evaluate the role of 
D2D communications in the traffic downloading. 

In the simulation, we compare our scheme with the follow¬ 
ing two transmission schemes: 

1) SETS: The serial broadcasting transmission scheme, 
where the packets are transmitted to UEs from the AP serially 
without exploiting the D2D communications. 

2) FDMAC-H: In EDMAC-H, the concurrent transmission 
scheduling algorithm borrows the design ideas of the greedy 
coloring (GC) algorithm in EDM AC ifTSl . and performs edge 
coloring iteratively on the first unscheduled hops of trans¬ 
mission paths in a non-increasing order of weight with the 
conditions for concurrent transmissions satisfied. After each 
coloring, the set of first unscheduled hops is updated since the 
hops after these scheduled links become the first unscheduled 
hops. In EDMAC-H, the transmission path selection algorithm 
is the same as PCDS, and the detailed pseudo-code illustration 
can be found in ll^ . 

B. Comparison with the Optimal Solution 

To show the gap between the concurrent transmission 
scheduling algorithm and the optimal solution of the MILP, we 
first compare PCDS (Section V.C.) with the optimal solution of 
the MILP (Section V.B.). Since obtaining the optimal solutions 
is time-consuming, we simulate a smaller scenario of six UEs. 
All other simulation parameters are as defined in Section lVI-AI 

We plot the delay and throughput comparison of PCDS and 
the optimal solution under Poisson traffic in Pig. Prom 
the results, we can observe that the gap between the delay 
of PCDS and the optimal solution is small under light load, 
and the gap increases slowly with the traffic load. The gap 
is only about 17.2% at the traffic load of 3.33. In terms of 
network throughput, the gap is also very small. The gap is 
only about 2.8% at the traffic load of 3.33. Therefore, we have 
demonstrated that PCDS achieves near-optimal performance in 
terms of concurrent transmission scheduling. 




(a) Average transmission delay (b) Network throughput 
Fig. 2. Delay and throughput comparison of Optimal Solution and PCDS. 

We also plot the average execution time of PCDS and 
the optimal solution under Poisson traffic in Pig. [3] We can 


observe that the optimal solution takes much longer to execute 
than PCDS, and the gap increases with increasing traffic load, 
which indicates that PCDS has much lower computational 
complexity. 



Fig. 3. Execution time comparison of Optimal Solution and PCDS. 

C. Comparison with Existing Schemes 

We compare the delay and throughput of SETS, PDMAC- 

H, and PCDS under different traffic patterns. The maximum 
number of hops of transmission paths in Algorithm [T] Hmax 
is set to 4. 

1) Delay: We plot the average transmission delay of three 
schemes under different traffic loads in Pig. ID We can observe 
that the delay curves of the three protocols rise slowly under 
light traffic loads. The average delay of SETS and PDMAC- 
H begin to increase rapidly when the traffic load exceeds 

I. 5 and 2, while the delay of PCDS doesn’t have obvious 
growth until the traffic load is greater than 3. Under heavy 
loads, PCDS outperforms SETS and EDMAC-H significantly. 
With the traffic loads varying from 3 to 5, PCDS reduces the 
average delay by about 69.2% and 68.6% under Poisson and 
IPP traffic compared with EDMAC-H, respectively. Compared 
with SETS, PCDS reduces the average delay by about 75.5% 
and 75.5% under Poisson and IPP traffic, respectively. In 
SETS, since the links from the AP to UEs are adjacent, 
spatial reuse cannot be exploited. Purthermore, for UEs that 
have low transmission rates from the AP, their downloading 
suffers from higher delay. However, by D2D transmissions, 
PCDS allows UEs to receive packets from other UEs that 
have received the packets as well as the AP, and the low 
rate links from the AP to UEs are broken up into multiple 
hops of high rate, where more efficient spatial reuse can 
be exploited to improve transmission efficiency and reduce 
delay. The reason that PCDS outperforms EDMAC-H is that 
our concurrent transmission scheduling algorithm computes 
schedules with higher efficiency for the transmission paths 
selected by Algorithm [T] 

2) Throughput: The network throughput achieved by the 
three schemes are plotted in Pig. |5] All three protocols have 
similar performance under light loads, and the throughput 
increases almost linearly with the traffic load. This is due to 
the fact that almost all arriving packets are distributed to all 
UEs successfully within the simulation duration. When the 
traffic load exceeds 1.5 and 2, the increases of throughput 
under SETS and EDMAC-H fail to keep up with the traffic 
load, and the throughput curves begin to drop when the traffic 
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Fig. 4. Average transmission delay of the three transmission schemes under 
different traffic loads. 


load is over 2 and 2.5, respectively. However, the curves of 
PCDS only begin to drop at the traffic load of 4. Compared 
with SETS, PCDS increases the network throughput by about 
282.5% on average with the traffic load varying from 3 to 5 un¬ 
der Poisson traffic, and about 275.1% under IPP traffic. PCDS 
also increases the network throughput by about 107.2% on 
average with the traffic load varying from 3 to 5 under Poisson 
traffic compared with FDMAC-H, and about 98.5% under IPP 
traffic. With the increase of the traffic load, the delay increases, 
and a considerable number of packets cannot be transmitted 
within the simulation duration. In addition, there are also more 
packets that are not counted as successful transmissions due to 
their delays exceeding the delay threshold. D2D transmissions 
enable the AP to offload a part of traffic distribution to UEs, 
and thus the potential of concurrent transmissions is fully 
unleashed. 

3) Complexity: In terms of complexity, the SETS scheme 
has much lower complexity compared with PCDS, but its 
performance is much poorer. FDMAC-H has the same trans¬ 
mission path selection algorithm as PCDS, which is Cl(|Up). 
For the transmission scheduling algorithm, FDMAC-H has the 
complexity of Cl(|Up), which is the same as that of PCDS. 
Therefore, FDMAC-H has the same complexity as PCDS, but 
its performance is poorer than PCDS. 



Fig. 5. Network thi'oughput of the three transmission schemes under different 
traffic loads. 


D. Impact of the Maximum Number of Hops 

In this Section, we investigate the performance of PCDS 
under different maximum number of hops in the algorithm 
of transmission path selection, Hmax, and denote PCDS with 
Hmax equal to 2, 3, and 4 by PCDS-2, PCDS-3, and PCDS-4, 
respectively. 

In Fig. |6] we plot the average transmission delay of PCDS 
with different Hmax- Under light traffic loads, PCDS-4 and 
PCDS-3 have similar performance, but outperform PCDS-2 
significantly. Under heavy loads between 3 and 5, PCDS-4 has 
more significant advantages in terms of delay. PCDS-4 reduces 


the delay by about 62.1% on average with the traffic load 
between 3 and 5 under Poisson traffic compared with PCDS- 
3, and by about 71% compared with PCDS-2. With larger 
Hmax, the links of low channel quality from the AP to UFs 
are broken up into more hops, and concurrent transmissions 
can be exploited more fully to improve performance. 




(a) Poisson traffic (b) IPP traffic 

Fig. 6. Average transmission delay of PCDS with different Hmax- 

We plot the network throughput of PCDS with different 
Hmax in Fig. m We can observe that the throughput is 
consistent with the delay in Fig. |6] With the traffic load varying 
from 3 to 5, PCDS-4 increases the network throughput by 
about 55.5% on average compared with PCDS-3, and about 
57.9% compared with PCDS-2, respectively, under IPP traffic. 
With larger Hmax, more traffic downloading is offloaded from 
the AP to UFs, which exploits D2D transmissions with higher 
transmission rate to improve network throughput. 



(a) Poisson traffic (b) IPP traffic 

Fig. 7. Network throughput of PCDS with different Hmax- 


In Fig. |8] we plot the D2D ratios of PCDS with different 
Hmax- We can observe PCDS-4 and PCDS-3 have higher 
D2D ratios than PCDS-2, especially under heavy loads. Thus, 
D2D transmissions are exploited more fully and efficiently in 
PCDS-4 than PCDS-3 and PCDS-2, which is also consistent 
with the results in Fig. |6]and Fig. [T] 


Since each UF is only allowed as the downloading source of 
one neighboring UF once, the number of hops of the selected 
paths increases as a tolerance of 1 arithmetic progression start¬ 
ing from 1. If the number of UEs can be expanded to the sum 
of an arithmetic progression, the maximum number of hops of 

paths can be inferred as Otherwise, the maximum 

possible number of hops of the selected paths is 

Thus, although larger Hmax usually indicates better perfor¬ 
mance, the maximum number of hops of transmission paths 

in Algorithm [T| is limited by 


2 


. Therefore, Hmax 

should be selected according to actual network conditions in 
practice. 
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Traffic Load Traffic Load 

(a) Poisson traffic (b) IPP traffic 

Fig. 8. D2D Ratios of PCDS with different Hmax- 



[ 12 ] 

[13] 

[14] 

[15] 


VII. Conclusion 

In this paper, we proposed PCDS for popular content 
downloading in mmWave small cells in the 60 GHz band, 
which exploits both D2D transmission in close proximity and 
concurrent transmissions to improve transmission efficiency. 
Transmission path selection is optimized for better use of 
D2D communications and concurrent transmissions in con¬ 
tent downloading. Then a concurrent transmission scheduling 
algorithm is designed to exploit spatial reuse to improve 
transmission efficiency. Finally, extensive simulations under 
various traffic patterns demonstrate PCDS reduces transmis¬ 
sion delay and improves network throughput significantly 
compared with other existing schemes, especially under heavy 
load. Compared with FDMAC-H, PCDS improves the network 
throughput by about 102.9% on average with the traffic load 
between 3 and 5. 


References 

[1] “Qualcomm data challenge,” accessed February 2014, 
http://www.qualcomm.com/media/documents/wireless-networks-risingmeet- 
1 OOOx-mobile-data-challenge. 

[2] C. H. Doan, S. Emami, D. A. Sobel, A. M. Niknejad, and R. W. Broder- 
sen, “Design considerations for 60 GHz CMOS radios,” IEEE Commu¬ 
nications Magazine, vol. 42, no. 12, pp. 132-140, Dec. 2004. 

[3] F. Gutierrez, S. Agarwal, K. Parrish, and T. S. Rappaport, “On-chip 
integrated antenna structures in CMOS for 60 GHz WPAN systems,” 
IEEE J. Selected Areas in Communications, vol. 27, no. 8, pp. 1367— 
1378, Oct. 2009. 

[4] T. S. Rappaport, J. N. Murdock, and F. GutieiTez, “State of the art in 
60-GHz integrated circuits and systems for wireless communications,” 
Proceedings of the IEEE, vol. 99, no. 8, pp. 1390-1436, Aug. 2011. 

[5] ECMC TC48, ECMA standard 387, “High rate 60 GHz PHY, MAC and 
HDMI PAL,” Dec. 2008. 

[6] IEEE 802.15 WPAN Millimeter Wave Alternative PHY Task Group 3c 
{TG3c). Available: http://www.ieee802 org/15/pub/TG3c.html. 

[7] Draft Standard for Information Technology-Telecommunications and 
Information Exchange Between Systems—Local and Metropolitan Area 
Networks-Specific Requirements-Part II: Wireless LAN Medium Access 
Control (MAC) and Physical Layer (PHY) Specifications-Amendment 
4: Enhancements for Very High Throughput in the 60 Ghz Band, IEEE 
P802.11ad/D9.0, Oct. 2012. 

[8] J. Wang, Z. Lan, C. Pyo, T. Baykas, C. Sum, M. Rahman, J. Gao, 

R. Funada, F. Kojima, H. Harada, and S. Kato, “Beam codebook based 
beamforming protocol for multi-Gbps millimeter-wave WPAN systems,” 
IEEE J. Selected Areas in Communications, vol. 27, no. 8, pp. 1390- 
1399, Oct. 2009. 

[9] Y. Tsang, A. Poon, and S. Addepalli, “Coding the beams: Improving 
beamforming training in mmwave communication system,” in Proc. of 
IEEE Global Telecommunications Conference, Houston, TX, Dec. 5-9, 
2011 . 

[10] Z. Xiao, L. Bai, and J. Choi, “Iterative Joint Beamforming Training 
with Constant-Amplitude Phased An'ays in Millimeter-Wave Commu¬ 
nications,” IEEE Communications Letters, vol. 18, no. 5, pp. 829-832, 
May 2014. 

[11] Y.-B. Ko, V. Shankarkumar, and N. F. Vaidya, “Medium access control 
protocols using directional antennas in ad hoc networks,” in Proc. 19th 
IEEE INFOCOM, Tel Aviv, Israel, Mar. 26-30, 2000. 


[16] 

[17] 

[18] 

[19] 

[ 20 ] 

[ 21 ] 

[ 22 ] 

': 23 ] 

[24] 

[25] 

[26] 

[27] 

[28] 

[29] 

[30] 

[31] 

[32] 

[33] 


S. Singh, F. Ziliotto, U. Madhow, E. M. Belding, and M. Rodwell, 
“Blockage and directivity in 60 GHz wireless personal ai'ea networks: 
From cross-layer model to multihop MAC design,” IEEE J. Sel. Areas 
Commun., vol. 27, no. 8, pp. 1400-1413, Oct. 2009. 

A. Finamore, M. Mellia, Z. Gilani, K. Papagiannaki, Y. Grunenberger, 
V. Eri'amilli, “Is there a case for mobile phone content pre-staging?,” 
in Proceedings of the ninth ACM conference on Emerging networking 
experiments and technologies, Santa Barbai'a, CA, Dec. 9-12, 2013. 

X. Lin, R. Ratasuk, A. Ghosh, et. al., “Modeling, analysis, and optimiza¬ 
tion of multicast Device-to-Device transmissions,” IEEE Transactions on 
Wireless Communications, vol. 13, no. 8, pp. 4346^359, Aug. 2014. 

I. K. Son, S. Mao, M. X. Gong, and Y. Li, “On frame-based scheduling 
for directional mmWave WPANs,” in Pt'oc. IEEE INFOCOM, Orlando, 
FL, Mar. 25-30, 2012. 

Y. Li, Z. Wang, D. Jin, and S. Chen, “Optimal mobile content 
downloading in Device-to-Device communication underlaying cellulai' 
networks,” IEEE Transactions on Wireless Communications, vol. 13, 
no. 7, pp. 3596-3608, Jul. 2014. 

Y. Niu, Y. Li, D. Jin, L. Su, and D. Wu, “Blockage robust and 
efficient scheduling for directional mm Wave WPANs,” IEEE Trans. Veh. 
TechnoL, vol. PP, no. 99, Jun. 2014. 

M. N. Islam, A. Sampath, A. Maharshi, O. Koymen, N. B. Mandayam, 
“Wireless backhaul node placement for small cell networks,” in 2014 
48th Annual Conference on Information Sciences and Systems (CISS), 
Princeton, NJ, Mar. 19-21, 2014. 

V. Chandrasekhar, J. G. Andrews, and A. Gatherer, “Femtocell networks: 
a survey,” IEEE Communications Magazine, vol. 46, no. 9, pp. 59-67, 
Sep. 2008. 

Y. Zhu, Z. Zhang, et. al., “Demystifying 60GHz outdoor picocells,” in 
Proceedings of the 20th Annual International Conference on Mobile 
Computing and Networking, Hawaii, HI, Sep. 7-11, 2014. 

J. Qiao, L. X. Cai, X. Shen, and J. Mark, “STDMA-based scheduling 
algorithm for concurrent transmissions in directional millimeter wave 
networks,” in Proc. IEEE ICC, Ottawa, Canada, Jun. 10-15, 2012. 

L. X. Cai, L. Cai, X. Shen, and J. W. Mark, “REX: a Randomized 
Exclusive Region based Scheduling Scheme for mm Wave WPANs with 
Directional Antenna,” IEEE Trans. Wireless Commun., vol. 9, no. 1, 
pp. 113-121, Jan. 2010. 

C. Sum, Z. Lan, R. Funada, J. Wang, T. Baykas, M. A. Rahman, 
and H. Harada, “Virtual Time-Slot Allocation Scheme for Throughput 
Enhancement in a Millimeter-Wave Multi-Gbps WPAN System,” IEEE 
J. Sel. Areas Commun., vol. 27, no. 8, pp. 1379-1389, Oct. 2009. 

C.-S. Sum, Z. Lan, M. A. Rahman, et. al., “A multi-Gbps millimeter- 
wave WPAN system based on STDMA with heuristic scheduling,” in 
Proc. IEEE GLOBECOM, Honolulu, HI, Nov. 30-Dec. 4, 2009. 

J. Qiao, L. X. Cai, and X. Shen, et. al., “Enabling multi-hop concurrent 
transmissions in 60 GHz wireless personal area networks,” IEEE Trans¬ 
actions on Wireless Communications, vol. 10, no. 11, pp. 3824-3833, 
Nov. 2011. 

M. X. Gong, R. J. Stacey, D. Akhmetov, and S. Mao, “A directional 
CSMA/CA protocol for mmWave wireless PANs,” in Proc. IEEE 
WCNC’IO, Sydney, Australia, Apr. 18-21, 2010. 

Q. Chen, X. Peng, J. Yang, and F. Chin, “Spatial reuse strategy in 
mmWave WPANs with directional antennas,” in Proc. 2012 IEEE 
GLOBECOM, Anaheim, CA, Dec. 3-7, 2012. 

Q. Chen, J. Tang, D. Wong, X. Peng, and Y. Zhang, “Directional 
cooperative MAC protocol design and performance analysis for IEEE 
802.11 ad WLANs,” IEEE Trans. Veh. Tech., vol. 62, no. 6, pp. 2667— 
2677, Jul. 2013. 

Y. Niu, Y. Li, D. Jin, L. Su, and D. Wu, “A Two Stage Approach for 
Channel Transmission Rate Aware Scheduling in Directional mmWave 
WPANs,” Wireless Communications and Mobile Computing, DOI: 
10.1002/wcm.2521. 

Y. Niu, C. Gao, Y. Li, L. Su, D. Jin, and A. V. Vasilakos, “Exploiting 
device-to-device transmissions in joint scheduling of access and back¬ 
haul for small cells in 60 GHz band,” to appear in IEEE Journal on 
Selected Areas in Communications. 

S. Singh, R. Mudumbai, and U. Madhow, “Distributed coordination with 
deaf neighbors: efficient medium access for 60 GHz mesh networks,” in 
Proc. IEEE INFOCOM (San Diego, CA), Mar. 14-19, 2010, pp. 1-9. 
E. Shihab, L. Cai, and J. Pan, “A distributed asynchronous directional- 
to-directional MAC protocol for wireless ad hoc networks,” IEEE Trans. 
Veh. Tech., vol. 58, no. 9, pp. 5124^5134, Nov. 2009. 

J. Ning, T.-S. Kim, S. V. Krishnamurthy, and C. Cordeiro, “Directional 
neighbor discovery in 60 GHz indoor wireless networks,” in Proc. ACM 
MSWiM ’09, Tenerife, Spain, Oct. 26-30, 2009. 


















12 


[34] H. Deng and A. Sayeed, “Mm-wave MIMO channel modeling and 
user localization using sparse beamspace signatures,” in 2014 IEEE 
15th International Workshop on Signal Processing Advances in Wireless 
Communications (SPAWC), Toronto, Canada, Jun., 2014, pp. 130-134. 

[35] S. Y. Geng, J. Kivinen, X. W. Zhao, and P. Vainikainen, “Millimeter- 
wave propagation channel characterization for short-range wireless com¬ 
munications,” IEEE Trans. Veh. Technol, vol. 58, no. 1, pp. 3-13, Jan. 
2009. 

[36] H. Xu, V. Kukshya, and T. S. Rappaport, “Spatial and temporal char¬ 
acteristics of 60 GHz indoor channels,” IEEE J. Sel. Areas Commiin., 
vol. 20, no. 3, pp. 620-630, Apr. 2002. 

[37] R. C. Daniels and R. W. Heath, “60 GHz wireless communications: 
Emerging requirements and design recommendations,” IEEE Veh. Tech. 
Mag., vol. 2, no. 3, pp. 41-50, Sep. 2007. 

[38] Y. Niu, C. Gao, Y. Li, L. Su, and D. Jin, “Exploiting multi-hop relaying 
to overcome blockage in directional mmWave small cells,” to appear in 
Journal of communications and networks. 

[39] H. D. Sherali and W. P. Adams, A Reformulation-Linearization Tech¬ 
nique for Solving Discrete and Continuous Nonconvex Problems. Boston, 
MA: Kluwer Academic, 1999. 

[40] YALMIP YALMIP Wiki [Online]. Available: 

http://users.isy.liu.se/johanl/yalmip/ 



Yong Niu received B.E. degree in Beijing Jiaotong 
University, China in 2011. He is currently work¬ 
ing toward his PhD degree at the Department of 
Electronic Engineering, Tsinghua University, China. 
His research interests include millimeter wave com¬ 
munications, medium access control, and software- 
defined networks. 



Li Su received the B.S. degree from Nankai Uni¬ 
versity, Tianjin, China, in 1999 and Ph.D. degree 
from Tsinghua University, Beijing, China in 2007 
respectively both in electronics engineering. Now he 
is a research associate with Department of Electronic 
Engineering, Tsinghua University. His research in¬ 
terests include telecommunications, future internet 
architecture and on-chip network. 



Chuhan Gao is an undergraduate student from 
Tsinghua University, Beijing, China. He is cun'ently 
working toward the B.E. degree. His research in¬ 
terests include millimeter-wave wireless commu¬ 
nications, wireless networks, and software-defined 
networks. 


Yong Li (M’2009) received his B.S. degree in Elec¬ 
tronics and Information Engineering from Huazhong 
University of Science and Technology, Wuhan, 
China, in 2007, and his Ph.D. degree in elec¬ 
tronic engineering from Tsinghua University, Bei¬ 
jing, China, in 2012. During July to August in 
2012 and 2013, he worked as a Visiting Research 
Associate in Telekom Innovation Laboratories (T- 
labs) and HK University of Science and Technology, 
respectively. During December 2013 to March 2014, 
he visited University of Miami, EL, USA as a Visit¬ 
ing Scientist. He is cuiTently a faculty member of the Electronic Engineering 
at the Tsinghua University. 

His research interests are in the areas of networking and communications, 
including mobile opportunistic networks, device-to-device communication, 
software-defined networks, network virtualization, future Internet, etc. He 
received Outstanding Postdoctoral Researcher, Outstanding Ph. D Graduates 
and Outstanding Doctoral thesis of Tsinghua University, and his research is 
granted by Young Scientist Fund of Natural Science Foundation of China, 
Postdoctoral Special Find of China, and industry companies of Hitachi, ZET, 
etc. He has published more than 100 research papers and has 10 granted 
and pending Chinese and International patents. His Google Scholar Citation 
is about 440 with H-index of 11, also with more than 120 total citations 
without self-citations in Web-of Science. He has served as Technical Program 
Committee (TPC) Chair for WWW workshop of Simplex 2013, served as the 
TPC of several international workshops and conferences. He is also a guest- 
editor for ACM/Springer Mobile Networks & Applications, Special Issue 
on Softwai'e-Defined and Virtualized Future Wireless Networks. Now, he is 
the Associate Editor of EURASIP journal on wireless communications and 
networking. 




Depeng Jin received the B.S. and Ph.D. degrees 
from Tsinghua University, Beijing, China, in 1995 
and 1999 respectively both in electronics engineer¬ 
ing. 

He is an associate professor at Tsinghua University 
and vice chair of Department of Electronic Engi¬ 
neering. Dr. Jin was awarded National Scientific and 
Technological Innovation Prize (Second Class) in 
2002. His research fields include telecommunica¬ 
tions, high-speed networks, ASIC design and future 
Internet architecture. 


Zhu Han (S’01-CM’04-SM’09-F’14) received the 
B.S. degree in electronic engineering from Tsinghua 
University, Beijing, China, in 1997, and the M.S. 
and Ph.D. degrees in electrical engineering from the 
University of Maiyland, College Park, MD, USA, in 
1999 and 2003, respectively. 

From 2000 to 2002, he was an R&D Engineer 
with JDSU, Germantown, MD, USA. From 2003 
to 2006, he was a Research Associate with the 
University of Maryland. From 2006 to 2008, he was 
an Assistant Professor with Boise State University, 
Boise, ID, USA. He is currently an Associate Professor with the Department 
of Electrical and Computer Engineering, University of Houston, Houston, 
TX, USA. His research interests include wireless resource allocation and 
management, wireless communications and networking, game theory, wireless 
multimedia, security, and smart grid communication. He serves as an Asso¬ 
ciate Editor for IEEE Transactions on Wireless Communications since 2010. 
He received IEEE Fred W. Ellersick Prize in 2011 and the NSF CAREER 
award in 2010. 







